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Abstract. The motivation for this paper is to study the complexity of constant-width arithmetic 
circuits. Our main results are the following. 
r—{ ' 1. For every k > 1, we provide an explicit polynomial that can be computed by a linear-sized 

monotone circuit of width 2k but has no subexponential-sized monotone circuit of width k. 
It follows, from the definition of the polynomial, that the constant-width and the constant- 
depth hierarchies of monotone arithmetic circuits are infinite, both in the commutative and 
the noncommutative settings. 

We prove hardness-randomness tradeoff's for identity testing constant-width commutative 
circuits analogous to [KI03IDSY08] . 
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Using a rank argument, Nisan, in a seminal paper [N91J, showed exponential size lower bounds 
for noncommutative formulas (and noncommutative algebraic branching programs) that com- 
pute the noncommutative permanent or determinant polynomials in the ring ¥(X), where 
X = {xi, • • • , x n } are noncommuting variables. 

By Ben-Or and Cleve's result [BC92] . we know that bounded- width arithmetic circuits 
(both commutative and noncommutative) are at least as powerful as formulas (indeed width 
three is sufficient). Can we extend Nisan's lower bound arguments to prove size lower bounds 
for noncommutative bounded-width circuits? Motivated by this question we make some simple 
motivating observations in this section. We first recall some basic definitions. 
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Definition 1. \N91^RS05^ An Algebraic Branching Program (ABP) over a field F and vari- 
ables x\, X2, ■ ■ ■ , x n is a layered directed acyclic graph with one source vertex of indegree zero 
and one sink vertex of outdegree zero. Let the layers be numbered 0, 1, • • • , d. Edges only go 
from layer i to i+ 1 for each i. The source and sink are the unique layer and layer d vertices, 
respectively. Each edge in the ABP is labeled with a linear form overF in the input variables. 
The size of the ABP is the number of vertices. Each source to sink path in the ABP computes 
the product of the linear forms labeling the edges on the path, and the sum of these polynomials 
over all source to sink paths is the polynomial computed by the ABP. 

The scalars in an ABP can come from any field F. If the input variables X = 
{xi,X2, ■■■ ,x n } are noncommuting then the ABP (or circuit) computes a polynomial in the 
free noncommutative ring ¥(X). If the variables are commuting then the polynomial computed 
is in the ring F[X]. 

Definition 2. An arithmetic circuit over F and variables xi,x%, ■ ■ ■ ,x n is a directed acyclic 
graph with each node of indegree zero labeled by a variable or a scalar constant. Each internal 
node g of the DAG is labeled by + or x (i.e. it is a plus or multiply gate) and is of indegree 
two. A node of the DAG is designated as the output gate. Each internal gate of the arithmetic 



circuit computes a polynomial (by adding or multiplying its input polynomials). The polyno- 
mial computed at the output gate is the polynomial computed by the circuit. The circuit is 
said to be layered if its vertices are partitioned into vertex sets V\ U U . . . U Vt such that V% 
consists only of leaves, and given any internal node ginVi for i > 1, the children of g are 
either nodes from V\ (consisting of constants or variables) or nodes from the set V{-\. The 
size of a circuit is the number of nodes in it, and the width of a layered circuit is maxj>i \ Vi\. 
An arithmetic circuit over the field R is monotone if all the scalars used are nonnegative. 
Finally, a layered arithmetic circuit is staggered if, in each layer i with i > 1, every node 
except possibly one is a product gate of the form g = u x 1 , for some gate u from the previous 
layer. 

Note that the notion of bounded (i.e, constant) width staggered circuits of width w is 
identical to the notion of a straight-line program with w registers. The following lemma shows 
that staggered circuits of width w are comparable in power to width w — 1 (not necessarily 
staggered) arithmetic circuits. It holds in the commutative and the noncommutative settings. 
We postpone the proof of the lemma to the Appendix. 

Lemma 1. Given any layered arithmetic circuit C of width w and size s computing a poly- 
nomial p, there is a staggered arithmetic circuit C of width at most w + 1 and size 0(ws) 
computing the same polynomial. 

A seminal result in the area of bounded width circuits is due to Ben-Or and Cleve [BC92 
where they show that size s arithmetic formulas computing a polynomial in ¥[X] (or in ¥(X) 
in the noncommutative case) can be evaluated by staggered arithmetic circuits of width three 
and size 0(s 2 n). Bounded width circuits have also been studied under various restrictions 
in [LMR07 MR08 JR09J. However, they have not considered the question of proving explicit 
lower bounds. 

What is the power of arithmetic circuits of width 2? It is easy to see that the width-two 
circuit model is universal. We state this (folklore) observation. 

Proposition 1. Any polynomial of degree d with s monomials in F[xi, X2, ■ ■ ■ ,x n ] (or in 
¥(x\, • • • , x n ) ) can be computed by a width two arithmetic circuit of size 0(d-s). Furthermore, 
any monotone polynomial (i.e, with non-negative real coefficients) can be computed by a width 
two monotone circuit over R of size 0(d ■ s). 

Some Observations 

To motivate the study of constant-width circuits, we point out that, for the problem of 
proving lower bounds for noncommutative bounded width circuits, Nisan's rank argument is 
not useful. For the noncommutative "palindromes" polynomial P{xq, x\) = Y^we{x Xl y n ww R , 
the communication matrix M n (P) is of rank 2 n and hence any noncommutative ABP for it is 
exponentially large [N91]. However, we can give an easy width-2 noncommutative arithmetic 
circuit for P(xo,xi) of size 0(n). Indeed, we can even ensure that each gate in this circuit is 
homogeneous. 

Proposition 2. The palindromes polynomial P(xq,xi) has a width-2 noncommutative arith- 
metic circuit of size 0(n). 
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What then is a good candidate explicit polynomial that is not computable by width-2 
circuits of polynomial size? We believe that the polynomial P[ (of Section [2]) for suitable 
k is the right candidate. A lower bound argument still eludes us. However, if we consider 
monotone constant- width circuits then even in the commutative case we can show exponential 
size lower bounds for monotone width-fc circuits computing P^. Since Pi is computable by 
depth 2k arithmetic circuits (of unbounded fanin), it follows that the constant- width and 
the constant-depth hierarchies of monotone arithmetic circuits are infinite. We present these 
results in Section [2j 

Remark 1. Regarding the separation of the constant-depth hierarchy of monotone circuits, 
we note that a separation has also been proved by Raz and Yehudayoff in [RY09] ; their lower 
bounds show a super polynomial separation between the power of depth k multilinear circuits 
and depth k+1 monotone circuits for any k (sec [RY09J for the definition and results regarding 
multilinear circuits). In contrast, our separation works only for monotone circuits, and only 
for infinitely many k. Nonetheless, we think that our separation is interesting because the 
separation we achieve is stronger. More precisely, the results of [RY09] show a separation of 
the order of 2( logs ) 1+fi<1 k) (that is, there is a polynomial that can be computed by circuits 
of depth k + 1 and size s but not by depth k circuits of size 2( logs ) 1+fi<1 k ). On the other 
hand, our separation is at least as large as 2( logs ) c for any c > (see Section [2] for the precise 
separation) . 

A related question is the comparative power of noncommutative ABPs and noncommuta- 
tive formulas. Noncommutative formulas have polynomial size noncommutative ABPs. How- 
ever, s °( lo s s ) is the best known formula size upper bound for noncommutative ABPs of size s. 
An interesting question is whether we can prove a separation result between noncommutative 
ABPs and formulas. We note that such a separation in the monotone case follows from an 
old result of Snir [S80j . 

Proposition 3. Consider two noncommuting variables {xo,xi}. Let L denote the set of 
all monomials of degree 2n with an equal number of xq and x\, and consider the polynomial 
E G Q{x ,xi), where E = ^2 weL w. 

1. There is a monotone homogeneous ABP for E of size 0(n 2 ). 

2. Any monotone formula computing E is of size n Q ^ n \ 

Proof. The first part is directly from a standard 0(n 2 ) size DFA that accepts precisely the 
set L = {w € {xq, x\} 2n | w has an equal number of xo's and xi's}. The second part follows 
from the fact that such a monotone formula would yield a commutative monotone formula 
for the symmetric polynomial of degree n over the variables J/i,2/2> ■ • ■ , 2/2n : this is obtained 
by first observing that the formula must compute homogeneous polynomials at each gate. 
Furthermore, we can label each gate (and each leaf) by a triple (i,j, d) where j — i+ 1 = d is 
the degree of the homogeneous polynomial computed at this gate such that each monomial 
generated at this gate will occupy the positions from % to j in the output monomials containing 
it. Hence we have xo's at the leaf nodes labeled by triples (i,i, 1) for all 2n values of i. We 
replace the xo's labeled 1) by yi and each x\ by 1. The resulting formula computes 
the symmetric polynomial as claimed. Snir in [S80] has shown a tight n^ logri ) lower bound 
for monotone formulas computing the symmetric polynomial of degree n over the variables 
2/1,2/2, • • • ,2/2n- ■ 
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To illustrate again the power of constant width circuits, we note that there is, surprisingly, 
a width-2 circuit for computing the polynomial E. 

Proposition 4. There is a width-2 circuit of size for computing E if the field F has at 

least en 2 distinct elements for some constant c. 

Proof Sketch. This is based on the well-known Ben-Or trick [B80] for computing the symmetric 
polynomials in depth 3. We consider the polynomial g(xo,x\, z) = (xoz 2> * +1+1 + x\z + l) 2k+1 , 
where 2 fc_1 < n < 2 k . (z will eventually be a scalar from F.) The coefficient of z ( 2k+1 + l ) n + n 
in g is precisely the polynomial E. Following Ben-Or's argument, the problem of recovering 
the polynomial E can be reduced to solving a system of linear equations with an invertible 
coefficient matrix. Hence E can be expressed as a sum E = Pi9( x 0i x ii z i)i where the 

ZiS are all distinct field elements. The terms fyg^xo, x\, Zi) can be evaluated with one register 
using repeated squaring of xqz 2 + +1 + x%Zi + 1. The second register is used as an accumulator 
to compute the sum of these terms. ■ 

These observations are additional motivation for the study of constant-width arithmetic 
circuits. In Section [2] we prove lower bound results for monotone constant-width circuits. In 
Section [3] we explore the connection between lower bounds and polynomial identity testing 
for constant-width commutative circuits analogous to the work of Dvir et al [DSY08J . 

2 Monotone constant width circuits 

In this section we study monotone constant-width arithmetic circuits. We prove that they 
form an infinite hierarchy. As a by-product, the separating polynomials that we construct 
yield the consequence that constant-depth monotone arithmetic circuits too form an infinite 
hierarchy. All our polynomials will be commutative, unless we explicitly state otherwise. 
For positive integers k and £ we define a polynomial P^ on £ 2k variables as follows: 

p((x 1 ,x 2 ,. . . ,x £2 ) = Yi=i nj=i 

Pi+l{ x li x 2, ■ ■ ■ , x £2k+2) = Ya=1 Ili=l P k( x (i-l)e 2h + 1 +(j-i)£ 2h +li ■ ■ ■ i X (i-l)t^+ 1 +jt 2 '') 

An easy inductive argument from the definition gives the following. 

& — i 

Lemma 2. The "polynomial P^ is homogeneous of degree i k on £ 2k variables and has £ l ~ 1 
distinct monomials. 

By definition, P| can be computed by a depth 2k monotone formula of size 0(£ k ). Further- 
more, we can argue that the polynomials P^ are the "hardest" polynomials for constant-depth 
circuits. We make this more precise in the following observation. 

Proposition 5. Given a depth k arithmetic circuit C of size s, there is a projection reduction 
from C to the polynomial P^ where £ = 0(s 2k ). 

Proof Sketch. We sketch the easy argument. We can transform C into a formula. Furthermore, 
we can make it a layered formula with 2k alternating + and x layers such that the output gate 
is a plus gate. This formula is of size at most s 2k . Clearly, a projection reduction (mapping 
variables to variables or constants) will transform to this formula, for £ = 0(s 2k ). ■ 

It is easy to see the following from the fact that a monotone depth 2k arithmetic circuit 
of size s can be simulated by a monotone width 2k circuit of size O(s). 
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Proposition 6. For any positive integers £ and k there is a monotone circuit of width 2k 
and size 0{£ 2k ) that computes 

We now state the main lower bound result. For each k > there is Iq € Z + such that 
for all £ > £q any width k monotone circuit for P^ is of size Q(2 l ). We will prove this result 
by induction on k. For the induction argument it is convenient to make a stronger induction 
hypothesis. 

For a polynomial / € F[X], where X = {xi,X2, ••• , x n } let mon(/) = {m \ m is a nonzero 
monomial in /}. I.e. mon(/) denotes the set of nonzero monomials in the polynomial /. Also, 
let var(/) denote the set of variables occurring in the monomials in mon(/). Similarly, for 
an arithmetic circuit C we denote by mon(C) and var(C) respectively the set of nonzero 
monomials and variables occurring in the polynomial computed by C . 

We call a layered circuit C minimal if there is no smaller circuit C of the same width 
s.t mon(C) = mon(C'). It can be seen that for any monotone circuit C, there is a minimal 
circuit C of the same width s.t mon(C") = mon(C) and has the following properties. 

— The only constants used in C are and 1. Furthermore, no gate is ever multiplied by a 
constant. 

— By the minimality of C every node g in C has a path to the output node of C . Hence, 
given any node g in C computing a polynomial p, there is a monomial m such that 
mon(m • p) C mon(C'). In particular, this implies that if C computes a homogeneous 
multilinear polynomial, then p must be a homogeneous multilinear polynomial. 

— If C' computes a homogeneous multilinear polynomial of degree d, and if a node g in layer 
i also computes a polynomial p of degree d, then in layer i + 1, there is a sum gate g' 
such that g is one of its children. Thus, the gate g' computes a homogeneous multilinear 
polynomial p' of degree d such that mon(p) C mon(p'). In particular, mon(p) C mon(C'). 

We call a minimal circuit satisfying the above a good minimal circuit. We now show a 
useful property of minimal circuits C, which applies to circuits satisfying mon(C) C P%, for 
all f , > 1. 

Lemma 3. Let f = J2i=i Pi be a homogeneous monotone polynomial of degree d > 1 with 
var(Pi) fl var(Pj) = for all i ^ j. Given any good minimal circuit such that mon(C) C 
mon(f), we have the following: if a gate g in C computes a polynomial p of degree less than 
d, or a product of two such polynomials, then var(p) C var(Pi) for a unique i. 

Proof. For any polynomial q G ¥[x±,X2, - ■ ■ ,x n ] we can define a bipartite graph G{q) as 
follows: one partition of the vertex set is m.on(q) and the other partition var(g). A pair {x, m} 
is an undirected edge if the variable x occurs in monomial m. It is clear that the graph G(f) 
is just the disjoint union of all the G{Pi). 

If the polynomial p computed by gate g is of degree d! < d, then, since C is good, there 
is a monomial m of degree d! — d such that mon(m • p) C mon(C) C mon(/). This implies 
that G(m • p) is a subgraph of G(f). On the other hand, G{m ■ p) is clearly seen to be a 
connected graph. This implies that, in fact, G(m ■ p) is a subgraph of G(Pi) for some i and 
hence, var(p) C var(Pj) for a unique i. This proves the lemma in this case. 

Similarly, if p is a product of two polynomials of degree less than d, then G(p) is a connected 
graph, and by the above reasoning, it must be the subgraph of some G(Pi). Hence, the lemma 
follows. ■ 
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We now state and prove a stronger lower bound statement. It shows that P^ is even hard 
to "approximate" by polynomial size width-/c monotone circuits. 

Theorem 1. For each k > there is io £ Z + such that for all I > t§ and any width-k 
monotone circuit C such that 

mon{C) C mon(Pl) and \mon{C)\ > — - — , 

i/ie circuit C is of size at least |g . 

Proof. Let us fix some notation: given i € Z + and j £ [w] , we denote by gij the jth node in 
layer i of C and by »• the polynomial computed by gij. Also, given a set of monomials M, 
we say that a circuit C\ computes M if mon(Ci) 2 -W. 

Without loss of generality, we assume throughout that C is a good minimal circuit. The 
proof is by induction on k. The case k = 1 is distinct and easy to handle. Thus, we consider 
as the induction base case the case k = 2. Consider a width two monotone circuit C such that 
mon(C) C mon(P|) an d |mon(C)| > |mon(P|)|/2 = £ i+1 /2. Let / denote the polynomial 
computed by C. By Lemma [2] both / and P| are homogeneous polynomials of degree d = £ 2 . 

We write the polynomial P| as Si=i -f«> where var(Pj) = {^(j_iV3 +1 , . . . , x^a}. Note that 
var(Pj) n var(Pj) = for i ^ j. Let / = Y2i=i where mon(P 4 ') C mon(p) for each i. 

Since C is good and / is homogeneous, each gate of C computes only homogeneous 
polynomials. Moreover, since mon(C) C mondPg) an d var(Pj)(~lvar(Pj) = for i ^ j, LemmaH 
implies that given any node g in C that computes a polynomial p of degree less than d or a 
product of such polynomials satisfies var(p) C var(Pj) for one i. Consider the lowest layer (iq 
say) when the circuit C computes a degree d monotone polynomial. W.l.o.g assume that /j 0j i 
is such a polynomial. We list some crucial properties satisfied by ft 0) i and C. 

1. By the minimality of iq, the node gi i is a product gate computing the product of poly- 
nomials of degree less than d. Hence, var(/j i) C var(Pj) for exactly one i. W.l.o.g , we 
assume i = 1. Since deg(/j , 1) = d and C is good, we in fact have mon(/j 0i i) C mon(Pi). 

2. Since deg(/j 0) i) = d and C is good, we know that there is a node gi +ij lQ+1 that is a 
sum gate with <?j 0) i as child; gi 0+ ij i +1 computes a homogeneous polynomial of degree d 
and mon(/i 0+ i Jio+1 ) D mon(/j 0j i). Iterating this argument, we see that there must be a 
sequence of nodes g%^ for % > iq such that for each i, gi t j i is a sum gate with gi-ij l _ 1 
as child, such that mon(/i 0j i) C mon(/ !0+ i Jio+1 ) C mon(/i 0+2 ,j l0+2 ) . . ., and each f itji is a 
homogeneous polynomial of degree d. We assume, w.l.o.g, that ji = 1 for each i > iq. 

By the choice of io, note that the node gj 0i 2 either computes a polynomial of degree less 
than d or computes a product of polynomials of degree less than d. Hence, var(/j .2) Q var(Pj) 
for some i. If i > 1, we assume w.l.o.g. that var(p) C var(P2)- Let us consider the circuit C 
with the variables in var(Pi) Uvar(P2) set to 0. The polynomial computed by the new circuit 
C is now /' = / — P[ — P' 2 = J2i=3 P(- -^ e ^ denote the new polynomial computed by the 
node gij. Note that each qi j is now a constant polynomial. 

Consider the monotone circuit C" obtained from C as follows: we remove all the gates 
below layer io; the gate <?j 0i 2 in layer i$ is replaced by a product gate c x 1, where c is the 
constant it computes in C; from layer io onwards, all nodes of the form gn are removed; in 
any edge connecting nodes g^i and the node g^i is replaced by the constant 0. Clearly, 

C" is a width 1 circuit. For ease of notation, we will refer to the nodes of C" with the same 
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names as the corresponding nodes in C . For any node 5^2 in C" (i > io), let q' i2 De the 
polynomial it now computes. Crucially, we observe the following from the above construction. 

Claim 2. For each i > io, mon^ 2 ) D mon{qi^) \ mon(q,^i). 

We now finish the proof of the base case. Define a sequence i\ < %2 < ■ ■ ■ < it of layers 
as follows: for each j € [t], ij is the least i > ij-\ such that mon^i) 2 mon (Qij- 1,1)1 an d 
mon((7j ti i) = mon(/'). Clearly, t is at most the size of C. Note that it must be the case 
that q ijt i = qij-1,1 + 1,2- Hence, we have mon^i) = mon(g ij ._i j i) U mon(g ij ._i )2 ) = 
m o n (ftj_i,i) U (mon(^._ 1;2 ) \ mon(gj j ._ lj i)). By the above claim, the set mon(g ij _i i2 ) \ 
mon(g'j j _i i i), which we will denote by Sj, can be computed by a width- 1 circuit. Thus, 
mon(/') = mon(gj ti i) = mon((ft 0) i) U U* = i where each Sj can be computed by a width-1 
circuit. Since qi (h i is the zero polynomial, we have mon(/') = Uj=i 

Now, consider any width-1 monotone circuit computing a set S C P|. It is easy to see 
that the set S computed must have a very restricted form. 

Claim 3. The set S is of the form mon(p) where p = EieXi x i) Y\jex 2 x v an< ^ ^1(1X2 = 0. 

Clearly, as each set Sj satisfies Sj C var(P/) for some i, it can have at most £ 3 monomials. 
Therefore, if the monotone circuit C is of overall size less than 2 then it can compute a 
polynomial of the form P± + P 2 + /', where /' has at most 2^£ 3 monomials. Since |mon(P')| < 
|mon(Pj)| = £ e for each i, we have for suitably large t 

|mon(C)| < 2/ + 2^ < 3* < ^ = 
and the base case follows. 

The induction step. 

Consider any monotone circuit C of width k — 1 such that mon(C) C mon(P^_ 1 ) and 
|mon((7)| > |mon(P|_ 1 )|/2. As induction hypothesis we assume that C must be of size at 
least 27l0. 

Let Pi = Yfi=i^ii w ith var(Pj) = {xu_]\pk+i +1 , . . . ,x i( 2k+i} as in the base case. By 
definition, the I variable sets var(Pj) are mutually disjoint and each Pj has degree d = £ k . 
It is convenient to also write Pi = Y\.j=iQiji where each Qij is of type P^_ v We have 
var(Qij) = {a;(j-i)^2fc+i + ( J -_ 1 )^2fc +1 , . . . ,x^_^pk+i + jpk}. 

We start by considering any width k — 1 circuit C of size less than 2^/10 such that 
mon((7) C mon(P|). For any i E [£], by fixing all the variables outside var(Pj) to 0, we obtain 
a width k—1 circuit Cj of the same size s.t mon(Cj) C mon(Pj). Further, by setting all the vari- 
ables outside var(Qjj) to 1 for some j G [£}, we obtain a circuit Cij s.t mon(CVj) C mon(Qjj). 
By the induction hypothesis, we see that |mon(CV,-)| < |mon(Qy )|/2. Clearly mon(C'j) C 
mon(C'ii) x mon(Ci2) x . . . x mon(Cj£). Therefore, |mon(Q)| < FJ . |mon(C'jj)| < |mon(p)|/2 £ . 

Finally, as mon((7) = {jj^mon^i), |mon((7)| < Y2i |nion(Cj)| < |mon(P^)|/2^. We have estab- 
lished the following claim. 

Claim 4. For any width k — 1 circuit C of size less than 2 e /10 such that mon(C) C mon(P^), 

r 1 /Am ^ \mon(Pf)\ 
we have \mon(C)\ < t k . 



7 



For the induction step, consider any monotone width-A; circuit C such that mon(C) C 
mon(P^) and of size at most 2^/10. We will show that |mon(C)| < |mon(Pn|/2. W.l.o.g, we 
can assume that C is a good minimal circuit. Let / denote the polynomial computed by C; 
we write / = Yli=i where mon(P/) C mon(Pj) for each i. 

As in the base case, let io be the first layer where a polynomial of degree d is computed. 
W.l.o.g. we can assume that /j 0i i is such a polynomial. By the minimality of io, the node 
gi (h i must be a product node with children computing polynomials of degree less than d. 
This implies, as in the base case, that var(/j i) C var(Pj) for a unique i. W.l.o.g. we assume 
that i = 1. As before, we can fix a sequence of nodes gij i for each % > %q such that gij t 
is a sum gate with gi-i t j i _ 1 as a child. It is easily seen that mon(/j 0j i) C mon(/j 0+ ij io+1 ) C 
mon(/j 0+ 2j 4o+2 ) . . ., and each fij i computes a homogeneous polynomial of degree d. Renaming 
nodes if necessary, we assume j{ = 1 for all i. 

Now consider f^j for j > 1. By the minimality of io, we see that each /j j is either a 
polynomial of degree less than d or a product of two such polynomials. Hence, var(/j 0J ) C 
var(P s ) for some s 6 [f]. Thus, there is a set S C [£] s.t |5| = k' < k such that Uj>i vav (fiod) — 
\J seS var(P s ). Without loss of generality, we assume that those s € S that are greater than 1 
are among {2, 3, ... , k}. 

Consider the circuit C obtained when each of the variables in Use[fc] var (-fs) * s se t to 0. 
Let qij be the polynomial computed by gjj in C'. The polynomial computed by C is just 
f' = f — Y2 s e[k] P's- Note that qi j is now simply a constant for each j, and that the size of 
C is at most the size of C which by assumption is bounded by 2^/10. Using this size bound 
we will argue that C cannot compute too many monomials. 

We now modify C as follows: we remove all the gates below layer ioj each gate gi j with 
j > 1 is replaced by a product gate of the form c x 1 where c is the constant <7i 0i i computes 
in C'; from layer io onwards, all nodes of the form g^i are removed; in any edge connecting 
nodes gn and gi+ij for j > 1, the node gn is replaced by the constant 0. Call this new circuit 
C". Clearly, C" has size at most the size of C and width at most k — 1. For ease of notation, 
we will refer to the nodes of C" with the same names as the corresponding nodes in C . For 
any node gij in C" (i > io and j > 1), let q[ j be the polynomial it now computes. As in the 
base case, we observe the following from the above construction. 

Claim 5. For each i > io and each j > 1, mon{iq\ ■) D mon(qi t j) \ mon(qi y i). 

Using this, we show that the circuit C was essentially just using the gates gn to store 
the sum of polynomials computed using width k — 1 circuits. 

Construct a sequence of layers i% < i% <...<% in C as follows: for each j £ [t], ij is 
the least i > ij-i such that mon^i) 2 m on(q , j :i _ li i), and mon((/j tj i) = mon(/'). Surely, t is 
at most the size of C . Now, fix any ij for j > 1. Clearly, it must be the case that = 
Qij-1,1 + Qij -i,a f° r some s > 1; therefore, we have mon(g iji i) C mon(g, r i i i) U (mon(g ij ._i iS ) \ 
mon^.-i^)). Denote the set mon(q , j j ._i jS ) \ mon(gj j ._i i i) by Sj. Since the above holds for all 
j, and mon(g ij _i i i) = mon(g^_ li i), we see that mon(/') = mon(g i(i i) C mon(g i(]i i) U Uj Sj = 
\Jj Sj, since qi 0) i is the zero polynomial. 

We will now analyze \Sj\ for each j. By the above claim, there is a width k — 1 circuit 
C" of size at most the size of C such that Sj C mon(C") C P|. If the size of C (and hence 
that of C and C") is at most 2 € /10, it follows from ClaimHthat \Sj\ < |mon(P^)|/2 £ . Hence, 
we see that |mon(/')| < t|mon(P^)|/2^, which is at most |mon(P^)|/10. But we know that 
the polynomial / computed by the circuit C is of the form /' + X^iG[fc] ^ii w here |mon(P/)| < 



S 



|mon(Pf) 



|mon(Pj)| = [mon(Pf)|/£. Therefore, 

k ( k 1 \ 

|mon(/)| < -|mon(P^)| + |mon(/')| < |mon(P£)| (- + — ]< 

for large enough £. This proves the induction step. ■ 

For k G Z + and c > let Depth fc c and Widthfc )C denote the set of families {f n }n>o of 
monotone polynomials f n G M[xi,X2, • • • , x n ] computed by c • n c -sized monotone circuits of 
depth k and width k respectively. For k G Z + , let Depth fc = U c >o Depths c and Widths = 
U c>0 Widthfe )C . Thus, Depth fc and Widths denote the set of families of monotone polynomials 
computed by poly(n)-sized monotone circuits of depth k and width k respectively. Note that, 
for each k G Z + we have Depth fc C Widths . Moreover, from the definition of Pi, we see 

that the family {P fc } n G Depth 2 /%. Finally, in Theorem [T] we have shown that the family 

n l/2fc I 

{Pj: } n ^ Widths, for constant k. Hence, we have the following corollary of Theorem [TJ 

Corollary 1. For any fixed k G Z + , Width/, C Widths and Depth k C Depth^^. 

Theorem [1] can also be used to give a separation between the power of circuits of width 
(respectively, depth) k and fc + 1 for infinitely many k. We now state this separation. For any 
k G N and any function / : N — > N, let us denote by f k the k-th iterate of /, i.e the function 
/ o f o . . . o f. Given non-decreasing functions /, g : N — > N, call / a sub 1/k-th iterate of g 

v ' 

k times 

if f k (n) < g(n), for large enough n (closely related notions have been defined in |Sz61j and 
|RR97j). It can be verified that sub 1/fc-th iterates of exponential functions can grow fairly 
quickly: for example, for any e > and any k, c G N, the function 2^ logra ^ c is a sub 1/k-th 
iterate of T 1 " . 

We now state the precise separation that can be inferred from the above theorem. For any 
k,n G N with k > 2 and any polynomial p £ IR[xi,X2, . . . ,x n ], let Wk(p) (resepctively dfe(p)) 
denote the size of the smallest monotone width A; (respectively depth k) circuit that computes 
p. 

Corollary 2. There is an absolute constant a > such that the following holds. Fix any 
k G N where k > 2. Also, fix any non- decreasing function f : N — > N i/iaf is a su6 1/k-th iterate 

1 /2fe 

of2 an . Then, for large enough n, there is a monotone polynomial p G R[xi, X2, • • • , x n ] such 
that for some k' , k" G {k, k + 1, . . . , 2k — 1}, Wk'ip) > f(uik'+i(p)) and dk"(p) > f(dk"+i(p))- 

n l/2fe I 

Proof Let p denote the monotone polynomial P fc G M[xi,X2, ■ ■ ■ , x n \. Theorem [T] tells 
us that u>k{p) = i7(2L" 1/2fc J ). To obtain a lower bound on dk(p), note that any polynomial 
computed by a circuit of size s and depth k can be computed by a width k circuit of size 
0(s fc ); this tells us that dk(p) = 2 n ^ n \ Hence, there is some constant (3 > such that 
min{wk (p) , dk (p) } > 2@ nl/2k , for large enough n. 

By definition, p = P fc has a depth 2/c circuit of size 0(n), i.e d2k(p) = 0(n). Propo- 
sition [6] tells us that W2k(p) = 0(n) also. Hence, for some constant 7 > and large enough 
n, we have m&x{w2k(p), d2k(p)} < 7"-- 

The above statements imply that Wk{p) > g(u>2k(p)) and d^ip) > 5(<^2fc(p))j where g(n) = 

1 /2k 

2 an for some constant a > and n is large enough. Now, fix any non-decreasing function 
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/ : N — ► N that is a sub l/fc-th iterate of g. We see that Wk(p) > g(w2k{p)) > f k (w2k(p)) 
for large enough n; clearly, this implies that for some k! G {k, k + 1, . . . , 2k — 1}, we must 
have Wtfip) > f{wk'+i{p))- Similarly, there is also a k" G {k, k + 1, . . . , 2k — 1} such that 
dk"(p) > f(dk"+i(p))- ■ 

Similar corollaries hold for noncommutative circuits too. We define the polynomial in 
exactly the same way in the noncommutative setting. Note that any monotone bounded width 
noncommutative circuit computing P k automatically gives us a monotone commutative circuit 
of the same size and width computing the commutative version of P& . Hence, the lower bound 
of Theorem Q] also holds for noncommutative width- A; circuits. For k G Z + , let ncDepth fc and 
ncWidthfc denote the set of families of monotone polynomials {f n G M(xi,X2, ■ ■ ■ ,x n ) \ n G 
Z + } computed by poly(n)-sized monotone (noncommutative) circuits of depth k and width 
k respectively. Analogous to the commutative case, we obtain the following. 

Corollary 3. For any fixed k G Z + , ncWidth^ C ncWidth^h o,nd ncDepth k C ncDepth^^. 

And finally, we observe that the separations between width and depth k and k + 1 that 
hold in the commutative monotone case also hold in the noncommutative monotone case. 
Define, for any k,n G N with k > 2 and any polynomial p G M.(x±,X2, ■ ■ ■ ,x n ), let ncwk(p) 
(resepctively ncdk(p)) denote the size of the smallest monotone width k (respectively depth 
k) circuit that computes p. We have the following. 

Corollary 4. There is an absolute constant a > such that the following holds. Fix any 
k G N where k > 2. Also, fix any non- decreasing function f : N — > N that is a sub 1/k-th iterate 

l/2k 

of 2 an . Then, for large enough n, there is a monotone polynomial p G M(xi,X2, ■ ■ ■ ,x n ) 
such that for some k', k" G {k, k + 1, . . . , 2k — 1}, ncwk>(p) > f(ncwk'+i(p)) an d ncdyip) > 
f{ncd k „ +1 (p)). 

3 Identity testing for constant width circuits 

In this section we study polynomial identity testing for constant-width commutative circuits. 
Impagliazzo and Kabanets [KI03J showed that derandomizing polynomial identity testing is 
equivalent to proving arithmetic circuit lower bounds. Specifically, assuming that there are 
explicit polynomials that require super polynomial size arithmetic circuits, they use these 
polynomials in a Nisan-Wigderson type "arithmetic" pseudorandom generator that can be 
used to derandomized polynomial identity testing. This idea was refined by Dvir et al [DSY08] 
to show that if there are explicit polynomials that require super polynomial size constant-depth 
arithmetic circuits then polynomial identity testing for constant-depth arithmetic circuits can 
be derandomized (the precise statement involves the depth parameter explicitly [DSY08J). 

In this section we prove a similar result showing that hardness for constant-width arith- 
metic circuits yields a derandomization of polynomial identity testing for constant-width cir- 
cuits. We say that a family of multilinear polynomials {P n }n>o where P n (x) G ¥[x\, ■ ■ ■ ,x n ] 
is explicit if the coefficient of each monomial m of the polynomial P n can be computed in 
time 2 n ° (1) . 

Recall the notion of a staggered arithmetic circuit (Definition [2]) . 

Lemma 4. Let f G F[xi, X2, • • • , x n ] of degree m be computed by a staggered arithmetic circuit 
of size s and width w. Then Hi(f) (the i th homogeneous component of f) can be computed by 
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a staggered circuit of size poly(s,m) and width w + 0(1), provided F has at least deg(/) + 1 
many elements. 

Proof. Define a new polynomial g(x, z) G F[xi, £2, • • • , %n, z] as g(x, z) = f(x\z, X2Z, ■ ■ ■ , x n z). 
We can write f(x\Z, x^z, ■ ■ ■ , x n z) = Y^i=o Hi{f)z % where m = deg(/) and Hi(f) is the 
homogeneous part of /. Let {zq, z\, ■ ■ ■ , z m } be m + 1 distinct field elements. Consider the 



;tli 



matrix M defined as 



M 



/IzqzI 
1 zi z\ 

\\ z, 



in z m 



m ] 



We have the system of equations 

M(HoW,Hx(f),---,H m (f)) T 



(g(x,z Q ),g(x,zi) 



■ ■ ,g{x,z m )) T . 
G F such that H^f) 



Since M is invertible, it follows that there are scalars aij 
EJLo aijgix^j). 

Since f(x) has a width w circuit of size s, g(x, z) clearly has a (staggered) circuit of width 
w + 0(1) of size 0(s). It follows easily from the above equation for Hi(f) that each Hi(f) 
has a circuit of width w + O(l) and size 0(ms). ■ 



Lemma 5. Let P(x\, X2, • • • , x n , y) be a polynomial, over a sufficiently large field F, computed 
by a width w staggered circuit of size s. Suppose the maximum degree of y in P is r. Then for 
each j the j th partial derivative Q~ can be computed by a staggered circuit of width w + 0(l) 

and size (rs) ^. 

Proof. Let P(x,y) = J2l=o Ci(x)y l . As in Lemma H] each Ci(x) can be computed by a width 
iv + 0(1) staggered circuit of size 0(rs). Clearly, for each j the polynomial can be written 

as ^ 

' j^aijdix)^-^ 
i=j 

for aij G F, where a^j are field elements that depend only upon j. Therefore, we can easily 
give a staggered circuit of size 0(r 2 s) and width w + 0(1) for each polynomial ■ 



dyi 



The following lemma is proved in |DSY08| . For any polynomial g S F[xi,X2, • • • ,x n ] let 



Lemma 6. [DSY08, Lemma 3.2] Let P € ¥[xi,X2,---,x n ,y] and deg y (P) = r. Suppose 
f G F[xi, X2, • • • , x n ] such that P(x, f(x)) = and ^{0, /(O)) is equal to £ 7^ 0. Let P(x, y) = 
Y^i=i Ci(x)y l . Then for each k > there is a polynomial Qk G F[yo, yi, • • • , y r ] such that 

H<k(f) = H<k{Qk(Co, Ci, ■ ■ ■ , C r )). 

Using the above lemmata we prove our first theorem. 

Theorem 6. Let P G F[xi, X2, • • • , x n , y] and deg y (P) = r > 1 such that P has a stag- 
gered circuit of size s and width w. Suppose that P(x,f(x)) = for some polynomial 
f G F[xi,X2, • • • ,x n ] with deg(/) = m. Then f has a staggered circuit of size poly(s, (m + r) r ) 
and width w + 0(1) if char (F) > r and F is sufficiently large. 
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Proof. First we argue that we can assume w.l.o.g., as in Dvir et al |DSY08j . that fj^(0> /(0)) = 

£ 0. If ^-(x, f(x)) = we can replace P by Since char(¥) > r it is easy to see that 

there exists j : 1 < j < r such that ^£-(x, f(x)) ^ 0. Hence, we can assume ^{x, f(x)) ^ 0. 

Therefore, there is an a G F n such that ^P(a, /(o)) 7^ 0. We can assume that a = by 
appropriately shifting P as in [DSY08]. Let 

r 

P(x,y)=Y,C l (x)y\ 

i=l 

By Lemma M there is a polynomial Qfc G F[yo,---,yr] such that H<k(f) = 
H<k(Qk(Co, C\, ■ ■ ■ , C r )) for each < k < m. Putting k = m and letting Q m = Q we 
have /(3?) = H< m (Q(C , d, • • • , C r )). 

Let y* = (C o (0), • • • ,C r (0)) and deg(Q) = M. Define Im = {(«o, «i, • • • , «r) I 
a i € N, < M}. By expanding the polynomial Q at the point y* we get Q(y) = 

Thus, we can write 

r 

/(a?) = F< w [ £ Qa JJ(C*(a?) - 3(0))°*]. 

As the constant term of Ciix) — Cj(0) is zero, if we consider n[=i(Ci(X) ~~ Cj(0)) ai for some 
a with a.i> m then we will get monomials of degree more than m whose net contribution 
to f(x) must be zero. Hence we can write f(x) as 

r 

f(x) = H< m [ Qa]J(Ci(x) - Ci(0)n 

ael m 1=0 

where I m = {(ao, 011, ■ ■ ■ , a r ) \ a% G N, ^ aj < m}. Clearly, \I m \ < (m + r) r . Now, 
the polynomial ni=o(y* ~~ J/*)"* ^ as a simple 0(l)-width circuit C. We can compute 
n[=o(Ci(^) ~~ Ci(0)) ai by plugging in the staggered width w + 0(1) circuit for Ci(x) (as 
obtained in Lemma|5| where yi is input to C . Thus, we obtain a circuit of width w + 0(l) for 
^ael Q a rii=o(^»(^) ~~ Cj(0)) Qi that is of size polynomial in s and (m + r) r . By Lemma [4] 
we can compute its homogeneous components and their partial sums with constant increase 
in width. Putting it together, it follows that fix) can be computed in width w + 0(1) of size 
polynomial in s and (m + r) r . ■ 

We apply Theorem [6] to prove the main result of this section. 

Theorem 7. There is a constant c\ > so that the following holds. Suppose there is an 
explicit sequence of multilinear polynomials {-P m } m >o where P m (x) G ¥[xi, ■ ■ ■ ,x m ] and P m 
cannot be computed by arithmetic circuits of width w + c\ and size 2 mt , for constants w G 
Z + and e > 0. Then, for any constant C2 > 0, there is a deterministic 2( lo § ri ) 0(1> • b°^ 
time algorithm that, when given as input a circuit C of size and width w computing a 

polynomial f(x±, X2, ■ ■ ■ , x n ) of maximum coefficient size b, with each variable of individual 
degree at most (logn) C2 , checks if the polynomial computed by C is identically zero, assuming 
that the field F is sufficiently large and char(¥) > (logra) C2 . 
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Proof Sketch. The overall construction is based on the Nisan-Wigderson construction as 
applied in Impagliazzo-Kabanets [KI03] and Dvir et al [DSY08]. Hence it suffices to sketch 
the argument. 

1. Let m = (log n) c '( e,C2 ) and £ = (log n) c "( e ' C2 ^ where c" is suitably larger than d ' . 

2. Construct the Nisan-Wigderson design Si, ■ ■ ■ , S n C [£] such that \S{\ = m for each i and 
\Si (~1 Sj\ < logn. 

3. Consider the polynomial F(yi,y 2 , ■ ■ ■ , ye) = C(P m (y\Si), P m (y\S 2 ), ■ ■ • , P m (y\S n )). For 
any input y G F f we can evaluate F by evaluating P m (y\Si) for each i and then evaluating 
C on the resulting values. Since the P m are explicit polynomials and \S%\ has polylog(n) 
size we can evaluate P m in time 

2 (logn)°« 

4. We test if F(y) = using a brute-force algorithm based on the Schwartz-Zippel lemma. 
Consider a finite set S C F, such that |5| is more than deg(i ? ). Check if F(a) = for all 
a G S 1 ^ in time n°^\ If all the tests returned zero then return C = otherwise C ^ 0. 

The proof of correctness is exactly as in [KI03 DSY08J. Assuming the algorithm fails, after 
hybridization and fixing variables in C, we get a nonzero polynomial F 2 of the form 

F 2 (y\S i+ i,x i+ i) = F 1 (P m (y\S 1 n S i+1 ), P m (y\S 2 n Si+i), • ■ • ,P m (y|5i n S i+1 ),x i+1 ). 

where Fi(x\,x 2 , . . . ,Xi + \) can be computed by a width ty circuit of size poly(n) and 
F 2 (y\Si + i, P m (y\Si + i)) = 0. Note that the multilinear polynomials P m (y\Sj fl Si+i) depend 
only on logn variables and hence, they can be computed using brute force width-2 stag- 
gered circuits of size O(nlogn). Also, by Lemma HJ we know that F\ can be computed by 
a staggered circuit of size poly(n) and width at most w + 1. Putting the above circuits to- 
gether, it is easy to see that F 2 can be computed by a staggered circuit C of size at most 
poly(n).nlogn = poly(n) and width w + O(l). Now, by applying Theorem[6]to C , we get a 
circuit of width w + 0(1) to compute P m contradicting the hardness assumption. ■ 

Finally, we observe that the following analogue of |KI03t Theorem 4.1] holds for bounded 
width circuits. The proof is in the appendix. 

Proposition 7. One of the following three statements is false. 

1. NEXP C P/poly. 

2. The Permanent polynomial is computable by polynomial size width w arithmetic circuit 
over Q, where w is a constant. 

3. The identity testing problem for bounded width arithmetic circuits over Q is in NSUBEXP. 
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Appendix 



Proof Sketch of Lemma [JJ 

The circuit C is constructed by showing how to compute, for i > 1, the polynomials 
computed in layer i + 1 of C from the polynomials computed in the ith layer in C in a 
staggered fashion, using at most w layers of width at most w + 1. Equivalently, it amounts to 
designing a straight-line program with w + 1 registers such that: initially, w of the registers 
contain the polynomials computed in the w nodes of the i th layer. In the end, w of the w + 1 
registers will contain the polynomials computed at the i + 1 st layer of C. Note that this is 
trivial for i = 2 since all nodes in layer 2 have only leaves as children. For some i > 1, let the 
U denote the nodes of C in layer i and V the nodes of C in layer i + 1. 

We define an undirected multigraph G corresponding to layers i and i + 1 as follows: its 
vertex set V(G) is U. For each gate v € V in circuit C that takes inputs ui, u% £ U we include 
the edge {u±,U2} in E(G). Notice that if u\ = 112 we add a self-loop to E(G). Furthermore, 
if v £ V takes one input as a u £ U and the other inputs is a constant or a variable, then 
too we add a self-loop at vertex u. Finally, if both inputs to v are constants and/or variables, 
there is no edge in G corresponding to v. We note some properties of this graph G. 

1. We have |V(G)| < w and \E(G)\ + \V'\ < w, where V is the set of those nodes in V that 
take only constants and/or variables as input. 

2. Each vertex u E V(G) corresponds to a polynomial p u computed at u in the i th layer. 
Each edge e £ E(G) is defined by some v E V and it corresponds to the polynomial 
q e computed at v. In order to compute the polynomial corresponding to e we need the 
polynomials corresponding to its end points. 

We have w + 1 registers, w of which contain the polynomials p u ,u £ U. Our goal is to 
compute the polynomials q e , e € E(G) using these registers. Using the graph structure of G, 
we will give an ordering of the edges E(G). If we compute the polynomials q e in that order 
then for every q e computed we will have a free register to store q e (when we do not need a 
polynomial p u for further computation, we can free the register containing p u ). 

Thus, what we want to do is compute an ordering of the edges -E(Crjl] from the vertex set 
V(G). 

We pick edges from E[G) one by one. When e € E(G) is picked, we delete e from the 
graph and store q e in a free register. Crucially, note that when a vertex u € V(G) becomes 
isolated in this process the polynomial p u is not required for further computation and the 
register containing p u is freed. Thus, at any point of time in this edge-deletion procedure, the 
number of registers required is equal to the sum of the number of edges removed from G and 
the number of non-isolated vertices left in G. 

The edge picking procedure works as follows. We break G into its connected components 
Gi U Gi U . . . U G s U G s+ i U . . . U G s+ t, where Gi, G2, ■ ■ ■ , G s are the acyclic components and 
G s +i, . . . , G s+ t have cycles. We first compute the edges of G\, and then those of G2, and so 
on. At the end, we compute the polynomials corresponding to the nodes in V'. 

Each connected component Gi is processed as follows: if there is an edge e in Gi that is 
not a cut edge, we pick the edge e and delete it from the graph; otherwise, since every edge of 
Gi is a cut edge, Gi must be a tree, and in this case, we remove any edge e that is incident to 
a degree-1 vertex. Proceeding thus, we maintain the invariant that at all points, all but one 

1 We can blur the distinction between vertices and edges and the polynomials they represent. 
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of the components of Gj are isolated vertices. We can use this to show that the number of 
registers required at any point in the computation of q e for e G E{G{) is at most \E{Gi) \ + 1 
(in particular, if Gj is acyclic this is at most |V(Gj)|). 

Putting it all together, we can also show that the maximum number of nodes used in 
computing the edges of G is bounded by max{|U(G)|, \E(G)\ + 1, \E{G)\ + \V'\} < w + 1. 
Moreover, since at each step the polynomial of some node v £ V is computed, the total 
number of steps in the straight-line program is at most w. This proves the lemma. 

Proof of Proposition [7] 

The proof follows the same lines as that of [Kl03j Theorem 4.1]. A similar result for 
bounded- depth circuits is noted in [ DSY08 . Section 5]. We give a brief proof sketch. Assume to 
the contrary that all three statements hold. Following the proof in [KI03], NEXP will collapse 
to NP Perm . Hence, it suffices to show p Perm c NSUBEXP to derive a contradiction (to the 
nondeterministic time hierarchy theorem). The language Perm consists of all tuples (M,v), 
where M is an integer matrix and v is the binary encoding of Perm(M). The NSUBEXP 
machine will guess a polynomial size, width- w circuit G for the n x n Permanent polynomial 
over variables {xy|l < i,j < n}. Next, we want to check whether G indeed computes the 
Permanent polynomial. We can easily obtain a width-it; polynomial-sized circuit G& that 
computes the permanent of k x k matrix over variables {x^j 1 1 < i,j < k} from circuit G. 
Next we check whether B\ = C\(x) — x = 0. For n > k > 1 check that B k = C k (X^) - 

Y2i=i x l,%Gk-l{x\ ) = 0, where = (xi,j)i,ke[k] is the k x k matrix and is a minor 
obtained by deleting first row and i th column of X^ k \ It follows that if all the B^s are 
identically zero polynomials then G computes the Permanent polynomial. Since G^ has a 
width-w polynomial size circuit it follows that B^ can be computed by a polynomial-size width 
w + 0(1) circuit. We can now use the assumed deterministic sub exponential time algorithm 
for identity testing of bounded width circuits to check whether each B^ is identically zero. 
Putting it together, we have p Perm C NSUBEXP. 
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